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CROSS-REFERENCE TO RELATED APPLICATION 
5 The present nonpro visional application claims priority from United 

States Provisional Patent Application 60/232,598 (Attorney Docket No. 
AM4938/T37300), filed September 14, 2000, which is hereby incorporated by 
reference. 

1 0 BACKGROUND OF THE INVENTION 

Semiconductor processing tools are highly complex devices. Their 
performance in real time is typically evaluated by monitoring values of a large number 
of tool sensors that reflect operational parameters such as temperature, pressure, and 
power. This tool evaluation may be conducted by comparing empirically measured 

1 5 sensor values to values predicted by a model. 

An example of a model for predicting the behavior of a semiconductor 
processing tool is the Universal Process Modeling (UPM) technique. This model was 
developed by Triant Technologies, Inc., of Nanaimo, British Columbia. 

Fig. 1 1 is a simplified schematic diagram illustrating the UPM 

20 technique. As shown in step A of Fig. 11, semiconductor processing tool 1101 is 
operated, and input vector 1 102 comprising values 1 104 for tool sensors 1106 is 
sampled at a time during the tool run (step A). Sampled input vector 1 102 is then 
compared with reference data library 1 1 12 (step B). Reference data library 1 1 12 
represents a compilation of vectors 1 150-1 157 from previous normal operation of 

25 semiconductor processing tool 1101. Vectors 1 1 50- 1 1 57 of reference library 1112 
include a value for each sensor of input vector 1 102. 

As a result of the comparison of step B, vector subset 1110 comprising 
vectors 1 150, 1 153, 1 156, and 1 157 is compiled from reference data library 1 112 
utilizing a nearest neighbor selection process between input vector 1 102 and the vectors 

30 of reference data library 1112 (step C). Vectors 1150, 1153, 1156, and 1157 ofvector 
subset 1110 reflect sensor values of previous normal operation of tool 1 101 that most 
closely resemble input vector 1 102. A variety of techniques may be employed in the 
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nearest neighbor selection process as known to those of skill in the art. Precise details 
of the nearest neighbor section process utilized by the UPM model are proprietary. 

Next, vectors 1150, 1153, 1156, and 1157 of subset 1110 are combined 
to produce a single output prediction vector 1116 (step D). Output prediction vector 
1116 reflects the state of semiconductor processing tool 1 101 in relation to previous 
normal operation. Output prediction vector 1116 may be communicated to the tool 
operator in several ways. For example, as shown in Fig. 11, values 1118 of individual 
sensors 1 120 of output prediction vector 1116 may be combined to produce a single 
fault detection index 1 1 14 that reflects the values of all of the tool sensors (step E). 
Alternatively, as shown in Fig. 12, values 1118 representing each individual sensor 
1120 of the output prediction vector may be plotted along spokes 1202 of "bull's eye" 
graph 1200, with radial distance 1204 representing deviation of the measured sensor 
value from expected values. 

One aspect of the UPM modeling technique just described is that it does 
not consider possible correlation between groups of related sensor values, such as 
related tool pressures, related tool temperatures, or related tool powers. Rather, all 
sensors are accorded equal weight in generating the fault detection index. This 
approach thus does not include potentially valuable correlation between related 
operational parameters that could provide more reliable fault detection information. 

Moreover, while the bull's eye graph of Fig. 12 provides the tool 
operator with an organized presentation of real-time tool operational parameters, the 
operator must still continuously monitor each of the tool sensors in order to detect a 
fault. Doing this for a large number of sensors may occupy the operator's attention, 
diverting him or her from other important tool management tasks. 

Another aspect of the modeling technique shown in Fig. 1 1 is that 
selection of nearest neighbor vectors to form the vector subset and the output prediction 
vector is based solely upon the sensor values. Other potentially relevant information, 
for example the time during the tool run at which the input vector is sampled, is not 
taken into account in the nearest neighbor selection process. This may affect the 
model's accuracy where the input vector and the library vector are similar merely by 
chance, for example where a temperature component of the input vector is measured at 
an early stage (ramp up) of a tool run, while the temperature component of the library 
vector is measured at a late stage (ramp down) of a tool run. In such a case values of 



the temperature component of the input and library vectors may be similar by chance, 
but the library vector is not otherwise an accurate prediction of the input vector. 

Accordingly, more sophisticated techniques for fault detection of 
semiconductor processing tools are desirable. 

SUMMARY OF THE INVENTION 
Embodiments of the present invention relate to methods for fault 
detection of a semiconductor processing tool. In particular, embodiments of the present 
invention relate to methods and apparatuses that employ tool modeling techniques that 
enhance the accuracy of fault detection. 

One embodiment of a fault detection method in accordance with the 
present invention comprises sensing a group of correlating operational parameters of a 
semiconductor processing tool operating under a recipe, and sensing non-correlating 
operational parameters of the tool operating under the recipe. An input vector is 
formed including the group of correlating operational parameters and the non- 
correlating operational parameters. The input vector is compared to a reference data 
library comprising vectors from previous tool runs utilizing the recipe. Nearest 
neighbor vectors to the input vector are selected from the reference data library based 
upon a similarity with only the group of correlating operational parameters. A vector 
subset is compiled from the selected nearest neighbor vectors. The vector subset is 
combined into an output prediction vector, and a fault detection index is generated from 
the output prediction vector. 

This and other embodiments of the present invention, as well as its 
advantages and features, are described in more detail in conjunction with the text below 
and attached figures. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1A is a simplified flow chart summarizing one embodiment of the 

method according to the present invention. 

Fig. IB is a schematic diagram illustrating operational details of the 

method shown in Fig. 1 A. 

Fig. 2A plots values for foreline pressure for an initial series of test runs 

of a plasma etching tool. 



Fig. 2B plots values for foreline pressure for an intermediate series of 

test runs. 

Fig. 2C plots values for foreline pressure for a later series of test runs. 
Fig. 3 plots the second fault detection index of TABLE 2 over a series of 

5 test runs. 

Fig. 4 plots the third fault detection index of TABLE 2 over the same 
series of test runs of Figure 3. 

Fig. 5 plots the fourth fault detection index of TABLE 2 over the same 
series of test runs of Figure 3. 
10 Fig. 6 plots the first fault detection index of TABLE 2 over the same 

series of test runs of Figure 3. 

Figs. 7A plots gas 1 inlet pressure for portion Z of the test runs of 

Figures 3-6 

Figs. 7B plots foreline pressure for portion Z of the test runs of Figures 

15 3-6. 

Figs. 7C plots throttle valve for portion Z of the test runs of Figures 3-6. 
Figs. 7D plots chamber pressure for portion Z of the test runs of Figures 

3-6. 

Fig. 8 A plots fault detection index versus time for a fault detection 
20 method which does not utilize cluster sensor information in the selection process of the 
nearest neighbor vectors. 

Fig. 8B plots fault detection index versus time for a fault detection 
method which utilizes cluster sensor information in the selection process of the nearest 
neighbor vectors. 

25 Fig. 9A plots actual and predicted susceptor temperature over time for a 

properly functioning etch tool. 

Fig. 9B plots actual and predicted susceptor temperature over time for a 
method in which susceptor temperature is an active sensor type. 

Fig. 9C plots actual and predicted susceptor temperature over time for a 
30 method in which susceptor temperature is a passive/inclusive sensor type. 

Fig. 10 shows a simplified view of an apparatus configured to practice 
an embodiment of a method in accordance with the present invention. 

Fig. 1 1 is a simplified schematic diagram illustrating the steps of the 
UPM technique. 
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Fig. 12 is a "bull's eye" graph of a conventional fault detection method. 



DESCRIPTION OF THE SPECIFIC EMBODIMENTS 
As mentioned above, embodiments of the invention relate to methods 
5 and apparatuses that employ tool modeling techniques that enhance the accuracy of 
fault detection in semiconductor processing tools. In order to better appreciate and 
understand the present invention, reference is made to Figs. 1 A and IB and the 
accompanying text below. Fig. 1 A is a simplified flow chart showing the steps 
according to one embodiment of the method according to the present invention, while 

10 Fig. IB is a schematic diagram illustrating operational details of one example of the 

method shown in Fig. 1 A used in conjunction with detecting faults during the operation 
of a plasma etching tool 102. Reference to plasma etching tool 102 is made for 
exemplary purposes only. It is to be undestood that the present invention is applicable 
to detecting faults in other semiconductor fabrication tools including CMP tools, 

15 deposition tools and ion implant tools among others. 

Before utilizing the present invention to detect tool faults, a fault 
detection model must be developed. Developing the fault detection model includes 
associating various characteristics of tool 102 with sensors that are included in vectors 
of the model (Fig. 1A, step 2). As used herein, the term "sensor" refers to an 

20 operational parameter of the semiconductor fabrication tool. Sensors can be of 

different types, as is described in more detail below. As used herein, the term "sensor 
types" refers to the role played by a particular sensor value in modeling and fault 
detection processes. A particular sensor type may or may not represent a tool 
operational parameter that is measured in real time. 

25 In the example shown in Fig. IB, there are seven different sensors: 

chamber pressure sensor 106a, throttle valve pressure sensor 106b, foreline pressure 
sensor 106c, gas 1 pressure sensor 106d, out 1 sensor 122, cluster sensor 118 and 
susceptor temperature sensor 120. It is to be understood that the seven sensors 
illustrated in the example are for exemplary purposes only. A typical fault detection 

30 model according to the invention as employed in a production environment will include 
considerably more than seven sensors. 

After the various sensor are defined and assigned sensor types, data is 
collected during acceptable runs of the tool under a specific recipe and associated with 
the defined sensors (Fig. 1 A, step 4). Next, a model representing operation of the tool 



is constructed from the collected data (Fig. 1 A, step 6). The model includes a reference 
data library, as well as rules for producing an output vector and fault detection index 
from data of an input vector that is compared to the reference library. 

An example of a reference data library is shown in Fig. IB as library 
1 12. Reference data library 112 includes a plurality of reference data vectors 170-179 
compiled from the collected data. Each vector includes as components the sensors 
defined by the model, as well as the collected data that is associated with each sensor. 
Thus, each vector in library 1 12 of Fig. IB includes seven sensors. The data in library 
1 12 represents parameters of tool 102 during proper operation under a specific plasma 
etching recipe. Rules for producing the output vector and fault detection index from 
the model further include, among others, rules assigning weights to certain sensors and 
rules to decide comparison thresholds. 

Once library 1 12 is compiled and the fault detection model is completed, 
the model is ready to monitor tool 1 02 for faults in a production environment where 
tool 102 is operated under the specific recipe used to produce reference data library 
112. During such use, an input vector 104 is sampled from the tool (Fig. 1 A, step 8). 
Vector 104 includes the same sensors as the vectors in library 1 12. Once generated, 
vector 104 is input to the fault detection model in order to generate a fault detection 
index. 

As shown in Fig. 1 A, the fault detection index is created using a 
multistep process where first input vector 104 is compared to library 1 12 using a 
nearest neighbor approach to detect similar reference vectors (Fig. 1A, step 10). Next, 
the library vectors most similar to input vector 104 are compiled into a vector subset 
108 (Fig. 1A, step 12). An output prediction vector 1 16 is then generated from the 
vector subset (Fig. 1 A, step 14) and finally, a fault detection index 1 14 is generated 
from prediction vector 116 (Fig. 1A, step 16). Specific details concerning generation of 
fault detection index 1 14 during steps 10-16 according to one embodiment of the 
invention are discussed below with respect to Fig. IB. The generation of the fault 
detection index is based, in part, on applying different rules to different types of 
sensors. Thus, in order to better understand this example, a description of the different 
sensor types is in order. 

As previously mentioned, there are four different sensor types: active 
sensors, cluster sensors, passive/inclusive sensors and synthetic sensors. Active sensors 
are sensors that represent a group of correlating operational parameters. Embodiments 



of the invention group correlating active sensors together as one technique to improve 
fault detection. Sensor grouping enables a specialized fault detection index 1 16 to be 
calculated based upon tool operational parameters known to correlate with one another. 
Examples of correlating sets of tool operational parameters may include sets of tool 
5 temperatures or sets of tool pressures. Sensor grouping eliminates coincidences in tool 
data that could represent a fault, when in fact no tool fault has occurred. 

In the example of Fig. IB, it is known from operator experience that one 
group of correlating parameters of plasma etching tool 102 relates to pressure. 
Accordingly, chamber pressure sensor 106a, throttle valve pressure sensor 106b, 

10 foreline pressure sensor 106c and gas 1 pressure sensor 106d are all classified as active 
sensors. As explained in detail below, active type sensors are utilized in the selection 
process of the nearest neighbor vectors to compile a related vector subset, and they are 
also included in generating the fault detection index from the output prediction vector. 

Another technique used by embodiments of the invention includes 

1 5 refining the nearest neighbor selection process by including criteria other than 

similarity in the value of vector components. For example, Fig. 9A plots predicted and 
measured values for susceptor temperature of a plasma etching tool that is functioning 
normally. Susceptor temperature is cyclical over time, with an increase in susceptor 
temperature during first stage 900 of the etch process followed by a decline in susceptor 

20 temperature during second stage 902 of the etch process. 

Embodiments of the invention reference the stage of the plasma etching 
process when the input vector is measured in the selection of nearest neighbor vectors. 
For example, as shown in Fig. 9A susceptor temperature 904 of an input vector 
measured during first stage 900 of the etch process may be the same as susceptor 

25 temperature 906 of a library vector measured during second stage 902 of the etch 

process. In such a case, while susceptor temperatures 904 and 906 may happen to be 
the same, the library vector is otherwise not an accurate prediction of the input vector. 
Accordingly, embodiments of the invention employ a cluster type sensor, such as 
sensor 1 18, to represent the stage in the plasma etching process of tool 102 at which 

30 input vector 104 is sampled. As described in detail below, cluster type sensor values 
are included in the nearest neighbor selection process, but are not included in 
calculating the fault detection index. 

A third type of sensor, the passive/inclusive sensor type, is a sensor that 
represents a tool operational parameter that does not correlate strongly with the active 
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type sensors, but which is expected from operator experience to provide important tool 
fault information. An example of a passive/inclusive sensor type is susceptor 
temperature sensor 120 in Fig. IB. The susceptor temperature of plasma etching tool 
102 does not correlate strongly with the active (pressure) sensors 106a-d, but it is 
5 known from user experience that susceptor temperature is a key fault indicia for the 
particular etch process being monitored. Thus, embodiments of the invention include 
passive/inclusive sensor 120 in calculation of a fault detection index, even where the 
active type sensors used to predict the output vector comprise a set of correlating tool 
pressures. Passive/inclusive type sensors are not utilized in the nearest neighbor 

10 selection process, but are utilized in generating the fault detection index. 

Finally, a fourth class of sensor is the synthetic sensor type, which may 
be employed to expand the predictive capability of the model. Specifically, synthetic 
type sensors represent physical quantities that cannot be measured during real time 
operation of the tool, but which can be measured and assigned to a particular vector 

15 upon completion of the semiconductor process. In the specific example of Fig. IB, a 
synthetic sensor type 122 represents a critical dimension (CD) of a semiconductor 
feature resulting from plasma etching. The CD cannot be measured during etching, but 
can be measured after etching and then be included as a component of a library vector. 
Subsequent selection of nearest neighbor vectors would result in prediction of a critical 

20 dimension. 

Synthetic sensor types are merely predicted by the model, and are not 
utilized in the selection process of the nearest neighbor vectors or in generating the 
fault detection index. Input of the synthetic type sensor 122 is not required by the 
model and input vector 104 includes a null value for synthetic type sensor 122. 
25 However, as discussed below, the output vector produced by the model includes a 
predicted value for the synthetic type sensor 122. 

For reference, a general summary of the role of each sensor type is listed 
below in TABLE 1. 
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TABLE 1 



SENSOR 
TYPE 


EXAMPLE 


INPUT 
REQUIRED? 


VALUE 
PREDICTED 
BY MODEL? 


CORRELATION 
WITH OTHER 
ACTIVE 
SENSORS? 


USED TO 
SELECT 
VECTOR 
SUBSET? 


USED TO 
CALCULATE 

FAULT 
DETECTION 

INDEX? 


Active 


chamber 
pressure 


Yes 


Yes 


Yes 


Yes 


Yes 


Passive/ 
Inclusive 


susceptor 
temperature 


Yes 


Yes 


No 


No 


Yes 


Synthetic 


out 1 


No 


Yes 


Not Applic. 


No 


No 


Cluster 


stage 


Yes 


No 


Not Applic. 


Yes 


No 



Returning to Fig. 1 A, once the input vector has been sampled from the 
semiconductor fabrication tool, the next step is to compare the input vector to the 
5 reference library compiled during prior normal operation of the tool. Referring back to 
the nearest neighbor algorithm of step 10, the example shown in Fig. IB employs two 
steps to gauge similarity between input vector 104 and vectors 170-179 of reference 
data library 112. In a first test, input vector 104 is compared with reference data library 
112 and similarity factor 124 is generated for each of vectors 170-179. Similarity 

10 factor 124 reflects similarity to active type sensors 106a-d of input vector 104. One 
approach to generating similarity factor 124 may be through the operation of the 
proprietary UPM software program previously described. Vectors having a similarity 
factor 124 exceeding a cut-off value (0.97 in the example of Fig. IB) pass the first test. 
The values of passive/inclusive type sensor 120 and synthetic type sensor 122 are not 

15 included in calculating similarity factor 124. As shown in Fig. IB, vectors 170, 172, 
173, 177, and 178 pass the first test. 

In the second test, cluster type sensor 1 18 of input vector 104 is 
compared with cluster type sensors of the vectors of reference data library 112. In the 
example shown in Fig. IB, this comparison test requires identity between the cluster 

20 sensors (i.e., only reference data library vectors at stage 1 pass the second test). 

Alternatively, this comparison test may require less than an exact match between the 
cluster sensor types (i.e., where the cluster sensor is quantified in units of seconds 
rather than "stages", library vectors measured within a specific range of seconds of the 
input vector may pass the second test). As shown in Fig. IB, library vectors 170, 172, 

25 177, and 178 also pass the second test. Vector 173 fails the second test because of non- 
identity between its cluster type sensor component and that of input vector 104. 
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Next in step 12, vector subset 108 is compiled from library vectors 170, 
172, 177, and 178 satisfying both the first and the second tests. The vectors of subset 
108 include predicted values for the active, cluster, passive/inclusive, and synthetic 
sensor types. 

5 In step 14, vectors 170, 172, 177, and 178 of vector subset 108 are 

combined to produce an output prediction vector 116. In the specific example shown in 
Fig. IB, output prediction vector 1 16 is generated by assigning a first set of weights 
126 to vectors 170, 172, 177, and 178, and then combining the weighted vectors. 
Alternatively, this vector combination step could utilize no weighting at all and assign 

10 equal weight to each vector of the subset. As yet another alternative, the combination 
could utilize a weighing system that considers similarity factor 124 previously 
generated for each vector. 

In step 16, fault detection index 1 14 is generated from output prediction 
vector 116. Sensor ranking is an additional technique that may be employed to 

15 improve the accuracy of fault detection. Active and passive/inclusive type sensors of 
vector 116 may be ranked in terms of their relative importance in indicating a tool fault 
utilizing second set of weights 128. The weighted sensor values can then be combined 
to produce fault detection index 114. The values of synthetic type sensor 122 and 
cluster type sensor 1 1 8 are not included in this calculation. 

20 Experimental results of fault detection in accordance with the present 

invention are now described below in connection with Figures 3A-9C. 



Experimental Results 

I. Sensor Grouping 

25 Plasma etching of oxide layers is frequently performed during the 

formation of vias in integrated circuits. As a result of oxide etching, polymer materials 
may accumulate that interfere with correct positioning of the wafer. This can affect 
product yields. Ordinarily the polymer residues are removed by periodic cleaning 
steps. 

30 To evaluate a fault detection method in accordance with the present 

invention, a series of over 1400 consecutive oxide plasma etching runs utilizing an 
Applied Materials Centura® plasma etching device were performed, without any 
intervening cleaning steps. Tool sensor information was collected and modeled 
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utilizing Modelware/RT software manufactured by Triant Technologies, Inc. of 
Nanaimo, British Columbia. 

Residual values represent the difference between an actual measured 
value from the tool and a value predicted by the model. Figs. 2A-2C plot the residual 
value of foreline pressure, at the beginning, middle, and end of the series of processing 
runs. Fig. 2A plots values for residual foreline pressure for a beginning portion the 
1400 consecutive test runs. Fig. 2 A shows that values for residual foreline pressure at 
first remained comfortably within expected tolerance region X. 

Fig. 2B plots values for residual foreline pressure for an intermediate 
portion of the test runs. Fig. 2B shows a drift in the residual value of foreline pressure 
to the lowermost portion of tolerance region X. 

Fig. 2C plots values for foreline pressure for a later portion of the test 
runs. Fig. 2C shows that the drift in the residual value of foreline pressure indicated in 
Fig. 2B has continued, such that the foreline pressure is now out of the tolerance region 
X. With conventional fault detection methods, this trend would likely alarm the tool 
and halt wafer production. However, correlation of the foreline pressure data with 
other tool information, and hence the actual existence of a fault, would remain 
undetermined. 

Therefore, the state of the plasma etching tool over the same series of 
1400 consecutive processing runs was examined utilizing four specialized fault 
detection indices, each including different active sensor groupings. The active sensors 
for each fault detection index are listed below in TABLE 2. 
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TABLE 2 



ACTIVE 


FIRST 


SECOND 


THIRD 


FOURTH 


SFTVSOR 


TTATTT T 


TT A T TT 'I' 


FAULT 


FAULT 


NO. 


DETECTION 


DETECTION 


DFTFCTTniV 


FIT? TF f~"TTOT\T 




i \ n f \ 




ttvttmt'v 


INDEX 


1 


chamber 
pressure 


temp eratur e 


DldS 


chuck current 


2 


throttle valve 
position 


r VmpV 

temperature 


lui wdiu puwer 


— r — ; ; 

chuck voltage 


3 


foreline 


heat exchanger 


reflected power 


inner helium 




pressure 


temp erature 




flow 


4 


gas 1 inlet 


cathode 


load blade 


outer helium 




pressure 


temperature 


position 


flow 


5 


gas 2 inlet 


none 


tune blade 


inner helium 




pressure 




position 


pressure 


6 


none 


none 


none 


outer helium 
pressure 



TABLE 2 illustrates the reduction in data offered by sensor grouping. Specifically, 
information of a total of 20 sensors is condensed into just four fault detection indices. 
5 The enhanced precision of fault detection offered by the indices of 

TABLE 2 is discussed below in conjunction with Figs. 3-7D. Fig. 3 plots the second 
fault detection index over the entire processing run of >1400 wafers. This second fault 
detection index, which focuses upon temperature, does not reveal a fault in the plasma 
etching tool. 

1 0 Fig. 4 plots the third fault detection index over the processing run. 

Apart from a single fault at around wafer 200, this third index focusing upon power 
also does not reveal a tool fault. The spike at around wafer 200 was associated with 
etching of a bare silicon wafer lacking an oxide layer that was inadvertently placed into 
the etching tool. 

15 Fig- 5 plots the fourth fault detection index over the processing run. 

This third fault detection index, which focuses upon wafer chucking, reveals an initial 
fault at around wafer 600, followed with increasingly frequent faults. The fourth fault 
detection index indicates a likely problem with chucking of the wafer about halfway 
through the 1400+ test runs. This was likely due to the buildup of residues due to an 

20 absence of periodic cleaning steps. 

Fig. 6 plots the first fault detection index over the processing run. This 
first fault detection index, which focuses upon pressure, reveals a fault at about wafer 
200, corresponding to processing of the bare silicon wafer. Moreover, Fig. 6 also 
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reveals an increase in the incidence of tool fault after about wafer number 600. This 
trend in the first fault detection index further indicates a tool fault. 

In order to more specifically examine the causes of the tool fault 
indicated by Figures 5 and 6, Figs. 7A-7D plot values for several of the active sensors 
5 of the first fault detection index during portion "Z" occurring late in the series of 
processing runs shown in Figs. 3-6. 

Fig. 7 A plots the gas 1 inlet flow. Fig. 7A indicates that the gas 1 flow 
inlet pressure remained within tolerance band X. The fault indicated by the fourth fault 
detection index was thus not attributable to gas 1 inlet flow pressure. 
10 Fig. 7B plots foreline pressure sensor for same portion Z of Fig. 7A. 

Fig. 7B indicates that foreline pressure was out of tolerance band X. 

Fig. 7C plots the throttle valve sensor for the same portion Z of Fig. 7A. 
Fig. 7D plots the chamber pressure sensor for portion Z. Figs. 7C and 7D reveal that 
the throttle valve and chamber pressure lie substantially out of acceptable tolerance 
15 bandX. 

The value of the sensor readings of Figs. 7A-7D, and their correlation to 
one another, confirm the existence of a tool fault as initially indicated by the decline in 
foreline pressure shown in Figs. 2A-2C. When combined, the throttle value, foreline 
pressure, and chamber pressure active sensor values account for the decline in the 
20 fourth fault detection index, and provide a more accurate picture of the status of the 
plasma etching tool than any single sensor considered independently. 

B. Cluster Sensor 

While the above experimental results illustrate the impact of sensor 
25 grouping in revealing the existence of a tool fault, utilization of the cluster sensor also 
enhances the accuracy of fault detection. 

Fig. 8 A plots a fault detection index versus time for a process run of a 
plasma etching tool experiencing a tool fault. The method of Fig. 8A considers the 
value of the cluster sensor in the selection process of the nearest neighbor vectors to 
30 compile the vector subset. Fig. 8 A represents a highly accurate picture of the status of 
the tool, with the fault detection index progressively worsening from time Ti until the 
tool is ultimately halted and alarmed at time T 2 . 

By contrast, Fig. 8B plots the same process run where the value of the 
cluster sensor is not included in the selection process of the nearest neighbor vectors to 
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compile the vector subset. Fig. 8B represents a less accurate picture of the status of the 
tool, with an abrupt drop in the fault detection index occurring at time T 3 immediately 
prior to alarming and halting of the tool at time T 4 . The advance warning of a fault 
provided by the method of Fig. 8A would afford the tool operator significant lead time 
5 to recognize and correct a fault. Early fault recognition would thus reduce the number 
of wafers ultimately falling outside of acceptable tolerances, raising tool yields. 

C. Passive/Inclusive Sensor 

Utilization of the passive/inclusive sensor type also provides benefits in 
10 fault detection. For example, an operational parameter such as susceptor temperature 
may provide important fault detection information. However, susceptor temperature 
does not correlate strongly with the related active pressure sensors of the first fault 
detection index of TABLE 2. 

If the susceptor temperature is defined as an active sensor, it is included 
1 5 in the nearest neighbor selection process. However, this is undesirable where the tool 
performance is to be modeled based upon other strongly correlating sensors. The role 
of the passive/inclusive sensor type is best understood in conjunction with FIGS. 9A- 
9C below. 

Fig. 9A plots measured susceptor temperature and the predicted 
20 susceptor temperature range over time, for a properly functioning etch tool. Both 
measured and predicted susceptor temperature are cyclical between times Ti and T 2 . 

Fig. 9B plots measured susceptor temperature and the predicted 
susceptor temperature range over time, in a predictive model in which the susceptor 
temperature is an active sensor type. Fig. 9B indicates that at time Ti the tool 
25 experiences a fault and the measured susceptor temperature departs from its expected 
cyclical behavior, remaining constamt through T 2 . Susceptor temperature does not 
correlate strongly with the group of related active pressure sensors. However, because 
susceptor temperature is an active sensor type and is considered in selection of the 
nearest neighbor vectors, the vector subset predicted by the model reflects the flat shape 
30 that conforms to the change in actual temperature. This change in predicted 

temperature masks the tool fault indicated by the constant flat profile of measured 
temperature between times Ti and T 2 . 

By contrast, Fig. 9C plots the measured susceptor temperature and a 
predicted susceptor temperature range over time, in a predictive model in which 
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susceptor temperature is a passive/inclusive sensor type. Like the tool of Fig. 9B, the 
tool of Fig. 9C experiences a fault at time TV However, because the measured 
susceptor temperature is not included in selecting the nearest neighbor vectors of the 
subset, the predicted susceptor temperature range does not remain flat at time Ti to 
match the measured value. Rather, in Fig. 9C the flat profile in actual susceptor 
temperature beginning at time T] passes outside of the predicted range and is thus 
recognizable as a tool fault. 

Apparatus for Fault Detection 

Fig. 10 shows a simplified view of an apparatus configured to practice 
an embodiment of a method in accordance with the present invention. Apparatus 1000 
includes semiconductor fabrication tool 1002 including wafer processing chamber 1004 
having wafer 1006 positioned therein. Apparatus 1000 also includes first sensor 1008 
and second sensor 1010 operatively coupled to tool 1002 to detect operational 
parameters of tool 1002. 

Controller 1012 is in communication with tool 1002, and with first and 
second sensors 1008 and 1010. Memory 1014 is coupled to controller 1012, and 
memory 1014 stores a computer program in computer readable format including 
computer instructions to control said apparatus to sense a group of correlating 
operational parameters from the semiconductor processing tool operating under a 
recipe. The computer instructions also control said apparatus to compare the group of 
correlating operational parameters to corresponding operational parameters recorded 
during prior runs of the tool utilizing the recipe, and to generate a fault detection index 
from comparing the sensed group of operational parameters to the corresponding 
operational parameters recorded during prior runs. 

While the present invention has been described in Fig. IB in conjunction 
with detecting a fault in a plasma etching tool, the semiconductor fabrication tool of 
Fig. 1 0 is not limited to this specific embodiment. In accordance with one alternative 
embodiment of the present invention, a fault in a chemical-mechanical polishing (CMP) 
apparatus could be detected. In accordance with other alternative embodiments of the 
present invention, faults in other semiconductor fabrication tools such as chemical 
vapor deposition tools and lithography apparatuses could be detected. 

In addition, the above invention has been described in conjunction with 
a fault detection method utilizing selection of nearest neighbor vectors from a reference 
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data library, this is not required by the present invention. The technique of grouping 
together sensors that correlate with one another to create fault detection indices could 
be utilized in other modeling approaches, and the method would still remain within the 
scope of the present invention. 

Given the above detailed description of the present invention and the 
variety of embodiments described therein, these equivalents and alternatives along with 
the understood obvious changes and modifications are intended to be included within 
the scope of the present invention. 
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